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Abstract 



The r-parity tensor of a graph is a generalization of the adjacency matrix, where the tensor's entries 
dénote the parity of the number of edges in subgraphs induced by r distinct vertices. For r — 2, it is 
the adjacency matrix with l's for edges and — l's for nonedges. It is well-known that the 2-norm of the 
adjacency matrix of a random graph is 0(y/n). Here we show that the 2-norm of the r-parity tensor is 
at most f(r)y/n log ' r ' n, answering a question of Frieze and Kannan [3] who proved this for r = 3. As 
a conséquence, we get a tight connection between the planted clique problem and the problem of finding 
a vector that approximates the 2-norm of the r-parity tensor of a random graph. Our proof method is 
based on an inductive application of concentration of measure. 

1 Introduction 

It is well-known that a random graph G(n, 1/2) almost surely lias a clique of size (2 + o(l))log 2 n and a 
simple greedy algorithm finds a clique of size (1 + o(l)) log 2 n. Finding a clique of size even (1 + e) log 2 n 
for some e > in a random graph is a long-standing open problem posed by Karp in 1976 [6] in his classic 
paper on probabilistic analysis of algorithms. 

In the early nineties, a very interesting variant of this question was formulated by Jerrum [5] and by 
Kucera [7]. Suppose that a clique of size p is planted in a random graph, Le., a random graph is chosen and 
ail the edges within a subset oî p vertices are added to it. Then for what value of p can the planted clique 
be found efficicntly? It is not hard to see that p > c^/n log n suffices since then the vertices of the clique will 
have larger degrees than the rest of the graph, with high probability [7] . This was improved by Alon et al 
[TJ to p = Çl(y/n) using a spectral approach. This was refined by McSherry [8] and considered by Feige and 
Krauthgamer in the more gênerai semi-random model [2j. For p > lO^/n, the following simple algorithm 
works: form a matrix with l's for edges and —l's for nonedges; find the largest eigenvector of this matrix 
and read off the top p entries in magnitude; return the set of vertices that have degree at least 3p/4 within 
this subset. 

The reason this works is the following: the top eigenvector of a symmetric matrix A can be written as 



maximizing a quadratic polynomial over the unit sphère. The maximum value is the spectral norm or 2-norm 
of the matrix. For a random matrix with 1,-1 entries, the spectral norm (largest eigenvalue) is 0(y/n). 
In fact, as shown by Fùredi and Komlos [HI], a random matrix with i.i.d. entries of variance at most 1 
has the same bound on the spectral norm. On the other hand, after planting a clique of size yfn times a 
sumeient constant factor, the indicator vector of the clique (normalized) achieves a higher norm. Thus the 
top eigenvector points in the direction of the clique (or very close to it). 

Given the numerous applications of eigenvectors (principal components), a well-motivated and natural 
generalization of this optimization problem to an r-dimcnsional tensor is the following: given a symmetric 




tensor A with entries Ak 1 k 2 ...k r , find 

||Aj| 2 = max A(x, ■■■ ,x), 

>:||x||=l 

where 

, . . . , x = y ' Ai 1 i 2 ._i r x ii x i2 . . . x ir . 

iiî2...i r 

The maximum value is the spectral norm or 2-norm of the tensor. The complexity of this problem is open 
for any r > 2, assuming the entries with repeated indices are zéros. 

A beautiful application of this problem was given recently by Frieze and Kannan |3J. They defined the 
following tensor associated with an undirected graph G — ( V, E) : 

Aijk = EijEjkEki 

where Eij is 1 is ij € E and —1 otherwise, i.e., A^k is the parity of the number of edges between k 
présent in G. They proved that for the random graph G„.i/2, the 2-norm of the random tensor A is 0{\fn), 
i.e., 

sup \ AijkXiXjXk < Cy/nlog c n 

where c, C are absolute constants. This implied that if such a maximizing vector x could be found (or 
approximated), then we could find plantcd cliques of size as small as n 1 / 3 times polylogarithmic factors in 
polynomial time, improving substantially on the long-standing threshold of Çl(y/n). 

Frieze and Kannan ask the natural question of whether this connection can be further strengthened by 
going to r-dimensional tensors for r > 3. The tensor itself has a nice generalization. For a given graph 
G = (V,E) the r-parity tensor is defined as follows. Entries with repeated indices are set to zéro; any other 
entry is the parity of the number of edges in the subgraph induced by the subset of vertices corresponding 
to the entry, i.e., 

Akt,...,k r = Yï Ekikj- 

l<-i<j<r 

Frieze and Kannan's proof for r — 3 is combinatorial (as is the proof by Fùredi and Komlos for r = 2), based 
on counting the number of subgraphs of a certain type. It is not clear how to extend this proof. 

Here we prove a nearly optimal bound on the spectral norm of this random tensor for any r. This 
substantially strengthens the connection between the plantcd clique problem and the tensor norm problem. 
Our proof is based on a concentration of measure approach. In fact, we first reprove the resuit for r = 3 
using this approach and then generalize it to tensors of arbitrary dimension. We show that the norm of the 
subgraph parity tensor of a random graph is at most /(r)0(y / n) whp. More precisely, our main theorem is 
the following. 

Theorem 1. There is a constant Ci such that with probability at least 1 — n _1 the norm of the r-dimensional 
subgraph parity tensor A : [n] r — > { — 1, 1} for the random graph G n ^i/2 is bounded by 

||A|| 2 <C 1 M 5l - 1 )/ 2 ^log (3r - 1)/2 n. 

The main challenge to the proof is the fact that the entries of the tensor A are not independent. Bounding 
the norm of the tensor where every entry is independently 1 or —1 with probability 1/2 is substantially easier 
via a combination of an e-net and a Hoeffding bound. In more détail, we approximate the unit bail with a 
finite (exponential) set of vectors. For each vector x in the discretization, the Hoeffding inequality gives an 
exponential tail bound on A(x, . . . , x). A union bound over ail points in the discretization then complètes the 
proof. For the parity tensor, however, the Hoeffding bound does not apply as the entries are not independent. 
Moreover, ail the (™) entries of the tensor are fixed by just the Q) edges of the graph. In spite of this heavy 
inter-dependence, it turns out that A(x, . . . , x) does concentrate. Our proof is inductive and bounds the 
norms of vectors encountered in a certain décomposition of the tensor polynomial. 
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Using Theorem [TJ we can show that if the norm problcm can be solved for tensors of dimension r, one 
can find planted cliques of size as low as Cn 1 / r poly(r, log n). While the norm of the parity tensor for a 
random graph remains bounded, when a clique of size p is planted, the norm becomes at least p T l 2 (using 
the indicator vector of the clique). Therefore, p only needs to be a little larger than n 1 ^ in order for the 
the clique to become the dominant term in the maximization of A(x, ... ,x). More precisely, we have the 
following theorem. 

Theorem 2. Let G be random graph G n> i/2 with a planted clique of size p, and let A be the r-parity tensor 
for G. For a < 1, let T(n, r) be the Urne to compute a vector x such that A(x, . . . , x) > a r \\ A\\2 whp. Then, 
for p such that 

n>p> C a~ 2 r 5 n 1/r log 3 n, 
the planted clique can be recovered with high probability in time T(n, r)+poly(n), where Cq is a fixed constant. 

On one hand, this highlights the benefits of finding an efficient (approximation) algorithm for the tensor 
problem. On the other, given the lack of progress on the clique problem, this is perhaps évidence of the 
hardness of the tensor maximization problem even for a natural class of random tensors. For example, if 
finding a clique of size (^(n 1 / 2 " 6 ) is hard, then by setting a — n 1 / 2r + e / 2 ~ 1 / 4 we see that even a certain 
polynomial approximation to the norm of the parity tensor is hard to achieve. 

Corollary 3. Let G be random graph G n l / 2 with a planted clique of size p, and let A be the r-parity 
tensor for G. Let e > be a small constant and let T[n,r) be the time to compute a vector x such that 
A(x, ...,x) > n 1 / 2+r£ / 2 - r / 4 ||A|| 2 . Then, for 

P > C r 5 n5~ e log 3 n, 

the planted clique can be recovered with high probability in time T(n, r) +poly(n), where Co is a fixed constant. 
1.1 Overview of analysis 

The majority of the paper is concerned with proving Theorem [TJ In Section [2. 11 we first reduce the problem 
of bounding A(-) over the unit bail to bounding it over a discrète set of vectors that have the same value 
in every non-zero coordinate. In Section 12.21 we further reduce the problem to bounding the norm of an 
off-diagonal block of A, using a method of Frieze and Karman. This enables us to assume that if (k%, . . . , k r ) 
is a valid index, then the random variables E^^, used to compute Aj^...^ are independent. In Section [2. 3[ 
we prove a large déviation inequality (Lemma [5]) that allows us to bound norms of vectors encountered in a 
certain décomposition of the tensor polynomial. This inequality gives us a considerably sharper bound than 
the Hoeffding or McDiarmid inequalities in our context. We then apply this lemma to bound \\A\\2 for r = 3 
as a warm-up and then give the proof for gênerai r in Section [3l 

In Section|4]we prove TheoremO We first show that any vector x that cornes close to maximizing A(-) 
must be close to the indicator vector of the clique (Lemma 2]) . Finally, we show that given such a vector it 
is possible to recover the clique f Lemma 1141) . 

2 Preliminaries 

2.1 Discretization 

The analysis of A(x, . . . , x) is greatly simplified when x is proportional to somc indicator vector. Fortunately, 
analyzing thèse vectors is sufficient, as any vector can be approximated as a linear combination of relatively 
few indicator vectors. 

For any vector x, we define x^ to be vector such that x\ + ^ — xi if Xi > and x^ = otherwise. 
Similarly, let x\ ' = Xi if Xi < and x\ = otherwise. For a set S Ç [n], let x S be the indicator vector 
for S, where the ith entry is 1 if i G S and otherwise. 
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Définition 1 (Indicator Décomposition). For a unit vector x, define the sets Si,... and Ti, . . . through the 
récurrences 



5, = \ t e [n] : (x^ -J2^- k X Sk h > 2- 



and 



fe=i 



T i= i6[n]:(xH-^ 2 -^) i <-2- 



k=l 



Lct yo(a;) = 0. For j > 1, let y^\x) = 2~^> X Sj and let î/~ j) (x) = -2^X Tj - We call the set {y {j) (x)}^ the 
indicator décomposition of x. 



Clearly, 



and 



\y (l) (x)\\ < max{||x(+)||,||x(-)||} < 1. 



N 



x- ^2 y {3) ( a 

j=~N 

We use this décomposition to prove the following thcorcm. 
Lemma 4. Let 

U = {k\S\-^ 2 X s : S Ç[n],ke {-1,1}}. 
For any tensor A over [n] r where WAW^ < 1 

max A(x^\...x^) < (2rrlognl) r max A(x^\...,x^) 

x( 1 >,...,x('->eB(0,l) x( 1 >,...,xM£!7 

Proof. Consider a fixed set of vectors x^\ . . . , x^ and let N — \r log 2 n\ . For each i, let 

N 
j=-N 

We first show that replacing x^ with xW gives a good approximation to A(x^ 1 \ . . . , x^). Letting e be 
the maximum différence between an x^ and its approximation, we have that 

n r/2 

max \\x^ — xW Il ~ e < — — 
ie[r] 2r 

Because of the multilinear form of A(-) we have 

r 

\A(x^\...,x^)-A(xW 7 ...,xCr))\ < ^ e V||A|| 



< 



er 



1 — er 

< n-^\\A\\ 

< 1. 

Next, we bound A(x < - 1 \ . . . , x( r )). For convenience, let Y" M = L)f=_ N y^ (x^). Then using the multlinear 
form of A(-) and bounding the sum by its maximum term, we have 

A(x^\...,x^) < (2NY max A(v {1 \ . . . , ijW) 

»(i)eym,..,i.W6i'W 

< (2iV) r max A(v™ , . . . , w (r) ). 

v( r > ,...,v( r Î£U 



□ 
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2.2 Sufficiency of off-diagonal blocks 

Analysis of A(x^ x \ . . . ,x^) is complicated by the fact that ail terms with repeated indices are zéro. Off- 
diagonal blocks of A are easier to analyze because no such terms exist. Thankfully, as Frieze and Kannan 
[3] hâve shown, analyzing thèse off-diagonal blocks suffices. Here we generalize their proof to r > 3. 
For a collection {Vi, V2, ■ ■ ■ , V r } of subsets of [n], we define 



V1X...XV, 



(1) 



E 



.4 



(1)^(2) 



k 1 ...k r X il X 



k 1 GV 1 ,...,k r GV r 



Lemma 5. Let P be the class of partitions of [n] into r equally sized sets V\, . . . , V r (assume wlog that r 
divides n). Let V = Vi X ... X V r . Let A be a random tensor over [n] r where each entry is in [—1, 1] and let 
R Ç B(Q, 1). If for every fixed (Vi, . . . V r ) G P, it holds that 

Pr[ max A\ v {x (1) , . . . , i (r) ) > f(n)} < 5, 
im,...,iWefl 



then 



Â n r/2 

Pr[ max A(x {1 \ . . . , x^ ) > 2r r f (n)} < —— , 
x^^.^xMeR f(n) 



Proof of Lemma El Each r-tuple appears in an equal number of partitions and this number is slightly more 
than a r~ r fraction of the total. Therefore, 



A(i (1) ,...A(i (r) ) 



IP! 



A\ v (xW,...A(xV) 

{v lt -,Vr}eP 
^ Tpî E \a\ v (xW,...A(xW) 



{V!,...,V r }£P 



We say that a partition {Vi, . . . ,V r } is good if 



max A\ v (x (1 \...,x (r) ) < /(n). 

œ< 1 ),...,œ('')6iî. 

Let the good partitions be denoted by G and let G = P \ G. Although the / upper bound does not hold for 
partitions in G, the trivial upper bound of n r / 2 does (recall that every entry in the tensor is in the range 
[-1, 1] and R Ç 5(0, 1)). Therefore 

A(x^\...A(x^)\<r^3f + ^n^). 



Since B[|G|/|P|] = S by hypothesis, Markov's inequality gives 



r\G\ 



ôrf/ 2 



and thus proves the resuit. 



□ 



2.3 A concentration bound 

The following concentration bound is a key tool in our proof of Theorem[TJ We apply it for t — O(N). 

Lemma 6. Let {u^}^ =1 and {v^}(L 1 be collections of vectors of dimension N' where each entry of is 
1 or —1 with probability 1/2 and \\v^\\2 < 1. Then for any t > 1, 



N 



Pr[530 W ' v {i) f >t]< e- t/ls (4y/e^) 



N 



i = l 
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Before giving the proof, we note that this lemma is stronger than what a naive application of standard 
theorems would yield for t — O(N). For instance, one might treat each (u™ -v^) 2 as an independent random 
variable and apply a Hoeffding bound. The quantity (uW • «M) 2 can vary by as much as N', however, so the 
bound would be roughly exp(— et 2 / N N' 2 ) for some constant c. Similarly, treating each Uj as an independent 
random variable and applying McDiarmid's inequality, we find that every Uj can affect the sum by as much 
as 1 (simultaneously) . For instance suppose that every Vj = and every Uj = 1. Then flipping 

would have an effect of \N' - ({N' - 2)/\ r N 7 ) 2 \ « 4, so the bound would be roughly exp(-ci 2 /NN') for 
some constant c. 



Proof of Lemma® Observe that \/YaLi{ u ^ -^M) 2 is the length of the vector whose ith coordinate is 
u b) . . Therefore, this is also équivalent to the maximum projection of this vector onto a unit vector: 



N N N' 

* > («M • «M) 2 = max > > % 
y z—1 i— 1 j— 1 



(i) (*) 



We will use an e-net to approximate the unit bail and give an upper bound for this quantity. Let C be 

the lattice ( — ^=Z ] . 

\2VN ) 



Claim 7. For any vector x, 



Thus. 



Mb < 2 max y ■ x. 

yg£nB(0,3/2) 



N N N' 

. V(uW -t-W) 2 <2 max Vy.VA,- 
\ 1=1 i=i j=i 

Consider a fixed y € £ n 5(0,3/2). Each is 1 or —1 with equal probability, so the expectation for 
each term is zéro. The différence between the upper and lower bounds for a term is 

2\2y j ufv{i) j \=i\y j v(i) j \ 

Therefore, 

JV JV' N N' 

î—l j — 1 i— 1 i— 1 

Applying the Hocffding bound gives that 

N N N' 

Pr[^(w (î) • v^) 2 >t]< Pv[2^y i ^ufv{i) j ^^]< e _t/18 - 

i— 1 i— 1 i— 1 

The resuit follows by taking a union bound over C H -E>(0, 3/2), whose cardinality is bounded according to 
Claim [H □ 

Claim 8. The number of lattice points in Cn 5(0,3/2) is at most (4y / e7r) JV 

Proof of Claim [51 Consider the set of hypercubes where each cube is centered on a distinct point in £ n 
5(0,3/2) and each has side length of (2y / n) _1 . Thèse cubes are disjoint and their union contains the bail 



(i 



B(Q, 3/2). Their union is also containcd in the bail B(0, 2). Thus, 

Vol(B(0,2)) 



|£n 5(0, 3/2)| < 



-N 



7r N/2 9 N 

< — - 2 N N N ' 2 

~ T{N/2 + \) 

□ 

Proof of Claim\2\ Without loss of generality, we assume that x is a unit vector. Let y be the closest point 
to x in the lattice. In each coordinate i, we have \xi — yt\ < (4 v / rî) _1 , so overall \\x — y\\ < 1/4. 
Letting 9 be the angle between x and y, we have 

vy „ i ttt; ( Ik-yll 2 \ 1/2 As 

6= VI- sin 2 9 > 1 1 „ n „ ' , llo1 > 



COSf 



||x||||y|| '" V max{||a; 2 ||, ||y|| 2 }/ V 16' 

Therefore, 

„ „ [Ï5 3 /Ï5 1 

x ■ y > \\y\\\ — > — W — > — . 

y - non V 16 ~ 4 V 16 ~ 2 



□ 



3 A bound on the norm of the parity tensor 

In this section, we prove TheoremQ] First, however, we consider the somewhat more transparent case of 
r = 3 using the same proof technique. 

3.1 Warm-up: third order tensors 

For r = 3 the tensor A is defined as follows: 

Theorem 9. There is a constant Ci such that with probability 1 — 

PII < CiV^log 4 n. 

Proof. Let V\, V2, V3 be a partition of the n vertices and let V = V\ x V2 x V3. The bulk of the proof consists 
of the following lemma. 

Lemma 10. There is some constant C3 such that 

max A\ v (x^,x^,x^)<C 3 Vnlogn 

iP),i( 2 ),i( 3 >e[/ 

wii/i probability 1 — n~ 7 . 

If this bound holds, then Lemma[4]then implies that there is some C2 such that 

max A\ v (x (1 \x {2 \x {z) ) < C 2 \/nlog 4 n. 

And finally, Lemma [S] implies that for some constant Ci 

max A(x (1) ,x (2) ,a; (3) ) < Ci^/nlog 4 ?! 

x (l), x (2) ):c (3) 6S ( 0j l) 

with probability 1 — n for some constant Ci. □ 
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Proof of Lemma \1(A Dcfinc 

Uk = {x £ U : |supp(x)| = fc} 
and consider a fixed n > ni > 112 > n.3 > 1. We will show that 

max A\ v (x m ,x^\x (3) ) < C 3 Vn~\ogn 

(£c( 1 ),œ< 2 ),a:(3))6t/„ 1 xU„ 2 xU„ 3 



(1) 



with probability n for some constant C3. Taking a union bound over the n choices of ni, n2,n 3 then 
proves the lemma. 

We bound the cubic form as 

max A\ v (x (1 \x^\x^) 

(x 1 - 1 ) ,x( 2 ) ,x'- 3 '>)£U„ 1 x(7„ 2 x£/„3 



max 



V Au u u r (1 V 2 V : 

v ; 1 2 3 k 1 ev 1 ,k 2 ev 2 ,k 3 ev 3 



< max 

(a:W,a:(3))el7 Ba xU„ 



\ 



E E A kik 2 k 3 x h 

fciev, \fc 2 eV2,fe3ev 3 



(a)_(3) 
fc 2 x fc 3 



max 

( œ ( 2 ),x( 3 ))et/„ 2 xt/„ 3 



\ E E E kik 2 X k 2) 2 E k2k 3 X k ^ E klk3 J 

\ fci6Vi \fc2eV2 k 3 £V 3 ) 



Note that each of the inner sums (over ki and /C3) are the dot product of a random —1, 1 vector (the E klk2 
and E k2k3 terms) and another vector. Our strategy will be to bound the norm of this other vector and apply 
Lemma |6l 

In more détail, we view the expression inside the square root a 



/ 



E 



4 fcl) 

k 2 



u (k 2 ) C*l*2)( iE (3)) fc3 

^ Ek!k 2 X k2 ^2 E ^k 3 X^Ekyks 

k 2 ev 2 k 3 ev 3 



u( fc 2).tî( fc l fc 2)(a:(3)) 



u (*i).^(fei)( x C 2 ),x( 3 )) 



(2) 



whcre u^ 2 " 1 = E k2k3 and = E klk2 , while 



u ( fcl ^) (:E (3) )fe3= 43) £feife3 



and 



Clearly, the u's play the rôle of the random vectors and we will bound the norms of the u's in the application 
of Lemma [51 

To apply Lemma [5] with k\ being the index i, vfj} — E klk2 above, we need a bound for every k\ G V\ on 
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the norm of v^- kl '(x^ 2 \ x^). We argue 



El^fa E E ^ k s X k 3 E kik 3 
k 2 V ka€Va 



< max max max 

fc 2 Gsupp( 

F? 



- E \y2 E k 2 ks Xks E ki k 3 ) 

fe26supp(x( x 2) \ k 3 / 



Here we used the fact that ||a;( 2 )||oo < % ^ '■ Note that F± is a function of the random variables {Eij} only. 
To bound i<i, we observe that we can apply Lemma[6]to the expression being maximized above, i.e., 



E(E^(4 3 X* 3 ) 



k-2 V &3 

over the index £2, with = Ek 2 k 3 - Now we need a bound, for every k 2 and fci on the norm of the vector 
v (ki **)( x &)^ We argue 



< ii^iile^ 

fe3 &3 

< 1. 

Applying Lemma[6]for a fixed ki,x^ and a^ 3 ) implies 



" 2 fe 2 esupp(x( 2 )) V k a / 



with probability at most 



C 3 n 2 logn W/| n „ 2 



cxp( ^ 5 )(4^) 



Taking a union bound over the |Vi| < n choices of ki, and the at most n n2 n na choices for x^ and a;*- 3 ), we 
show that 

Pr^ 2 > C 3 logn] < exp(- bg - ) (4v^F)" 2 nn n 'n n3 . 

18 

This probability is at most n _10 /2 for a large enough constant C3. 

Thus, for a fixed a^ 2 ) and x^ 3 \ we can apply Lemma[6]to Eqn. [2] with F 2 = C3 logn to get: 

E f E E k!kJ^ E ^fe*-^^!*»)) > i^n lûgn 
k 1 ev 1 \k 2 ev 2 \ k 3 ev 3 )) 

with probability at most exp(— 6*371 logn/18)(4y / e7r)™. Taking a union bound over the at most n n2 n n3 choices 
for x^ 1 and x^ 3 \ the bound holds with probability 

exp(-C 3 nlogn/18)(4V^ : )™ri n2 n n3 < n - w /2 

for large enough constant C3. 
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Thus, we can bound the squared norm: 

max A\ v (x<V,xW,xW) 2 
^ E ( E E ^ ( x< £ E E k2k3 xME klk X) 

kieVi \k 2 £V 2 \ k 3 <£V 3 // 

< C 2 ni log 2 n 

with probability 1 — n -10 . □ 

3.2 Higher order tensors 

Let the random tensor A be defined as follows. 

A kl ,...,k r = Yl Ek ikj 

l<2<j<r 

where E is an n x n matrix where each off-diagonal entry is — 1 or 1 with probability 1/2 and every diagonal 
cntry is 1. 

For most of this section, we will consider only a single off-diagonal cube of A. That is, we index over 
V\ X . . . X V r where Vi are an equal partition of [n]. We dénote this block by A\v- When fcj is used as an 
index, it is implied that fcj € Vi- 

The bulk of the proof consists of the following lemma. 

Lemma 11. There is some constant C3 such that 

max A\ v (x {1 \...,x {r) ) 2 < n(C 3 r logn)^ 1 

xW,...x( r )e!7 

with probability 1 — n~ 9r . 

The key idea is that Lemma [5] can be applied repeatedly to collections of it's and v's in a way analogous 
to Eqn. [2 Each sum over k r , . . . , k 2 contributes a C^r logn factor and the final sum over k\ contributes the 
factor of n. 

If the bound holds, then Lemma 2] implics that there is some C2 such that 

max A\ v (x^\x^,x^) 2 < C^+^n log 2r+( '- 1) n. 

xW,x( 3 ),x( 3 )6B(0,l) 

And finally, Lemma [5] implics that for some constant C\ 

max A{x {1 \x {2 \x^) < C[r 2r+2r+ ^n log 21 ^ 1 n 

xCi),s(2),x(3)eB(0,l) 

= C , [r 5r " 1 n log 31 "" 1 ^ 

with probability 1 — n~ l for some constant C\. 

Proof of Lemma [771 We define the set U k as in Eqn. [TJ It suffices to show that the bound 

max A\ v (x {1 \...,x {r) ) 2 < n(C 3 r logn)^ 1 

(xW,...xM)eU ni x...xU nr 

holds with probability 1 — n~ 10r for some constant C3, since we may then take a union bound over the n r 
choices of n > ni > . . . > n r > 1. 
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For convenience of notation, we define a family of tensors as follows 

*&;::£ = n E ^ ( 3 ) 

where the superscript indexes the family of tensors and the subscript indexes the entries. Note that for every 
k%, ■ ■ ■ , k r € V\ X . . . X V r , we have B^ kl '—' kr > = 1, since the product is empty. 

Note that the tensor B^ kl ----' k ^ dépends only a subset of E. In particular, any such tensor of order r — £ 
will dépend only on the blocks of E 

Fi = {E\ ViXV] :ij<j}. 

Clearly, F r = 0, Fi contains ail blocks, and Fg \ F e+ i — {E\v i xv e+1 '■ * < 
We bound the rth degree form as 

max A\ v (x {1 \...,x {r) ) 

i( 1 ),...,iWe!/„ 1 x...x!/„ r 

max V x?>BW(x< 2 \...x< r h 

xW,...,x( r >eu n ,x...xu nr ^ fel 



< max V BWW 2 ),...iW) 2 . (4) 

x( 2 ) ! ... I Hg[/„,x...x[;„ I . \ 

1 V feiGVi 



Observe that for a gênerai £, 



B ( kl .,...M) {x (i + i)^ . . ^W) = ^ £* lfa+ x« ( * 1 ''" , * <) (s (m) > . . . , x^) kl+1 , (5) 



where 



W (* 1 ,...,fc,) (a .«+D > . . . ja .(D )fc€+i = a; (m) B ( fcl ,..., fc<+1 ) (a; (/ +2 ) ) . . . , x (r))Y[ Ekike+1 . (6) 



e+i 

i<£ 



It will be convenient to think of B^ kl {x^ l+1 \ . . . ,x^) as the dot product of a random vector u^ kt \ 
where ug£ = £ fcffc , +1 and ^i.-.^^+i), . . .,a;W) t , +1 , so that 

B ( kl ,...M)( x it+i) } . , ^W) = u (fe) . „(*!,...,*<) (a .(i+i) > . . . j2 ,M). (7) 
The sum over fci G Vi from Eqn. [4] can therefore be expanded as 

]T ...a;M) a = ^ (^)./')( I ( 2 >..., I M)) 2 . 

fcieVi feievi 

Our goal is to bound \\v^ kl ^ (x^ 2 \ . . . , a^ r ))|| and apply Lemma[51 Notice that for gênerai t 

2 



v^-< k '\ x ( e+i \...,x^: 



2 

1 



< max max 

kx,—,k e œ('+ 1 )el7 n/+1 ...xW £U„ r 

— Y )B^-- k ^\x^ +2 \...,x^) 2 = ff. (8) 
n-e+i — ' 



Note that the quantity fg (define above) dépends only on the blocks Fg+i. 
The following claims will establish a probabilistic bound on f\. 
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□ 



Claim 12. The quantity 

fr-l = 1. 

Proof. Trivially, every B^ kx '-- kr) () 2 = 1. Therefore, for every subset S r C VÇ. such that |SV| = r V 

— Y, B^>~> k ^() 2 = l. 
Ur k r es r 

Claim 13. There is a constant C 3 such that for any £ € 1 ... r — 2 

Pr[/| > C 3 rf! +1 logn] <n~ 12r . 
We postpone the proof of Claim [Î3] and argue that by induction we have that 

fl< (C 3 rlogn) r - 2 

with probability 1 — n~ 12r r > 1 — n~ llr . 
Assuming that this bound holds, 

v {kl) {x {2 \...,x {r) ) < (C 3 rlogn) r - 2 

for ail ki S Vi and By Lemma [6] then 

Y, B^( x ( 2 \...x^) 2 = J2 (« (fcf) ^ (fei) ^ (3) ,---^ (r) )) 2 

fcieVi feieVi 

> n(C 3 r \ogny~ 1 

with probability at most 

P l Ï8 ) ^ eiT ' 

which is at most n~ llr for a suitably large C 3 . 

Altogcther thc bound of the lemma holds with probability 1 — 2n~ llr > 1 — n~ 10r . □ 

Proof of Claim \TR Consider a fixed choice of the following: 1) fci, . . . kg and 2) G U ne+1 , . . . x^ g U nr - 

From Eqn. [8l we have from définition that for every kg+i € Vf+i 

\\v^--^\x^ 2 \...,x^)\\ 2 2 <f 2 +1 . 

Therefore, by Lemma [5] 

B^-> k t+i\xV+ 2 \...,x^) 2 = ]T (^ +1) ^ (fel - fcf+l) (^ +2) ,...,a;W)) 2 

> C 3 rf 2 +1 n i+1 \ogn 

with probability at most 

/ C 3 rn e+1 logn\ — 

V 18 j ( 4 v e7r ) ■ 

Taking a union bound over the choice of ki, . . . kg. (at most n r ), and the choice of x^ +1 ^ G U ni+1 , . . . x^ £ U nr 
(at most n ( ' r ~ 1 ^ nt + 1 ), the probability that 

Il > C 3 rf 2 +l \ogn 



becomes at most 



■ xp ( _ g 3^+ 1 logn- ) (4Vn?; , 



For large enough C3 this is at most n . □ 
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Algorithm 1 An Algorithm for Recovering the Clique 
Input: 

1) Graph G. 

2) Integer p= \P\. 

3) Unit vector x. 

Output: A clique of size of p or FAILURE. 

1. Calculate y~r»"i°g™l ( x ), . . . 7 y^ rl °s n ~] ( x ) as defined in the indicator décomposition. 

2. For each such y^'(x), let S = swpp(y^ (x)) and try the following: 

(a) Find v, the top eigenvector of the 1, —1 adjacency matrix A\sxS- 

(b) Order the vertices (coordinates) such that V\ > . . . > vig\- (Assuming dot-prod is ^/l/2 below) 

(c) For £ = 1 to \S\, repeat up to n 30 logn times: 

i. Select lOlogn vertices Qi a t random from [£]. 

ii. Find Q2, the set of common neighbors of Q\ in G. 

iii. If the set of vertices with degree at least 7p/8, say P' has cardinality p and forms a clique in 
G, then return P' . 

(à) Return FAILURE. 



4 Finding planted cliques 

We now turn to Thcorcm [5] and to the problem of finding a planted clique in a random graph. A random 
graph with a planted clique is constructed by taking a random graph and then adding every and edge between 
vertices in some subset P to form the planted clique. We dénote this graph as G n l / 2 U K p . Letting A be 
the rth order subgraph parity tensor, we show that a vector x S B(0, 1) that approximates the maximum of 
A(-) over the unit bail can be used to reveal the clique, using a modification of the algorithm proposed by 
Frieze and Kannan [3]. 

This implies an interesting connection between the tensor problem and the planted clique problem. For 
symmetric second order tensors (i.e. matrices), maximizing A(-) is équivalent to finding the top eigenvector 
and can be done in polynomial time. For higher order tensors, however, the complexity of maximizing this 
function is open if éléments with repeated indices are zéro. For random tensors, the hardness is also open. 
Given the réduction presented in this section, a hardness resuit for the planted clique problem would imply 
a similar hardness resuit for the tensor problem. 

Given an x that approximates the maximum of A(-) over the unit bail, the algorithm for finding the 
planted clique is given in Alg. 0J The key ideas of using the top eigenvector of subgraph and of randomly 
choosing a set of vertices to "seed" the clique (steps 2a-2d) corne from Frieze-Kannan 3j. The major 
différence in the algorithms is the use of the indicator décomposition. Frieze and Kannan sort the indices 
so that x\ > . . . x n and select one set S of the form S = [j] where ||^4|sxs|| exceeds some threshold. They 
run steps (2a-2d) only on this set. By contrast Alg. U]runs thèse steps on every S — supp(y^) (x)) where 
j = — |Y log n\ , . . . |Y log n] . 

The algorithm succeeds with high probability when a subset S is found such that \Sn P\ > C^/\S\\ôgn, 
where C is an appropriate constant. 

Lemma 14 (Frieze-Kannan). There is a constant C5 such that if S Ç [n] satisfies \S D P\ > C l 5-\/|S'| logn 7 
then with high probability steps a)-d) of Alg. ^jïnd a set P' equal to P. 

To find such an subset S from a vector x, Frieze and Kannan require that X^gp x î — Clogn. Using 
the indicator décomposition, as in the Alg 21 however, reduces this to X^eP 2 ^ — C^Jlogn. Even more 
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importantly, using the indicator décomposition means that only one élément of the décomposition needs to 
point in the direction of the clique. The vector x could point in a very différent direction and the algorithm 
would still succeed. We exploit this fact in our proof of Theorem[5J The relevant claim is the following. 

Lemma 15. Let B' be set of vectors x € B(0, 1) such that 

|supp(y^(a;)) n P\ < Cs^suPP^O))! logn 

for every [Y log n] , . . . , \r log n\ } . Then, there is a constant C[ such that with high probability 

sup A(x, ...,x)< C' 1 r r 5r/2 ^\og 5r/2 n. 
xeB' 

Proof. By the same argument used in the discretization, we have that for any x G B' 

A(x,...,x) < (2\r\ogn]Y max A(x^,...,x^) 

x meYW(x),...x<.^eYM(x) 

< (2rrlogn]) r max A{x {1) , . . . , a: (r) ), (9) 
xm,...x^)eu' 

where 



U' = {|S|- 1/2 X 5 : S Ç [n], \Sn P\ < C 5 ^\S\Î^}. 

Consider an off-diagonal block V\ x . . . x V r - For each i <E 1 . . .r, let Pi = Vi H P and let E4 = Vi\ P. 
Then, breaking the polynomial up as a sum of 2 r terms, each corresponding to a choice of S\ € 

{i\,iîi},...,5 r e {P r ,Rr} gives 

max A\ v (x^,...,x^)<2 r max V A\ SlX ... xSr (x^ , ■ • ■ ,x {r) ). (10) 

xi 1 ) x^eU' x( r > x( r )p[/' — 

""' ' * Si£{Pi,Ri},...,S T e{P r ,Rr} 

By symmetry, without loss of generality we may consider the case where Si — Ri for i = 1 ... r — £ and 
Si = Pi for i — r — l + 1 . . . r for some l. Let V = Ri x . . . x R r —n x P r _£ +1 x . . . x P r . Then, 

^ m if^ A ^ xil) — xir))= s - s n 4? n w^--^, 

fclE-Rl k r _iGR r -i i— l...r— £ i,j'-id^—£ 

where (as defined Eqn. [3]) 

B^-^\x^\...,xn e ... x n 4? n *w 

fcr-j! + i6-Pr-* + i fc r eP r i=r-<+l..T i,3-i,i — i+Kj 

By the assumption that every xW <E U', this value is at most (Cslogn)^ 2 . Thus, 

/i^ 1 ' « (r) )< E •■■ E II 4? n *waaogn)*/ 2 . 

fciG-Ri k r _e£R r ^i i— l...r— £ i,j:i,j<r—£ 

Note that every edge E^fc. above is random, so the polynomial may be bounded according to Lemma 1111 
Altogether, 

max AUlx^,...,^) < (max{C 5 , C 3 } logn) r/2 . 

xW,...,œW e j/' 

Combining Eqn. [9l Eqn. IlOi and applying Lemma [5] complètes the proof with C[ chosen large enough. □ 

Proof of Theorem^ The clique is found by finding a vector x such that A(x, . . . , x) > a r |P| r / 2 and then 
running Algorithm 2] on this vector. Algorithm 2] clcarly runs in polynomial time, so the theorem holds if 
the algorithm succeeds with high probability. 

By Lemma 1141 the algorithm does succeed with high probability when x £ B', i.e. when some S G 
{supp(y— [r logn](a;), . . . , supp(y— [Ylogn"|(a;)} satisfies |5nP| > C 5 -v 1 /|5'| logn. 

We claim x ^ B' with high probability. Othcrwise, for some x G B' , 

A{x, ...,x)> a r p r/2 > C' V 5r / 2 v ^log 3r/2 n. 
This is a low probability event by Lemma [TÏÏ] if Cq > C[ . □ 



max 
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A Proof of Lemma [14 



Here, we give a Frieze and Kannan's proof of Lemma 11 41 for the reader's convenience. First, we show that 
the top eigenvector of ^4|s x s is close to the indicator vector for S PI P. 

Claim 16. There is a constant C such that for every S C [n] where \SC\P\ > C ^/\S\\ogn, the top eigenvector 
v of the matrix A\sxs satisfies 

J2 «< > V\snP\/2 

Proof. The adjacency matrix A can be written as the sum of x P X pT an d a matrix R representing the 
randomly chosen edges. Let u — x SnF I s/\S P\ Suppose that v is the top eigenvector of ^4|sxs and let 
c = u ■ v. Then 

|SnP| 1/2 = A(u,u) 

< A\ S xs(v,v) 



Hence 



= c 2 A\ S xs(u,u) + 2cy/l - c 2 A\ S xs(u,v - eu) + (1 - c 2 )A\ S xs(v - cu,v - eu) 
< c 2 \SnP\^ 2 + 3\\R\ S xsl 



c 2 >l-3- I|jR|sxs| 



By taking a union bound over the subsets S of a fixed size, it follows from well-known results on the 
norms of symmetric matrices ( [H [9] , also Lemma [6J that with high probability 



\\R\sxs\\ =0(y/\S\ logn) 

for every S Ç [n]. Therefore, the theorem holds for a large enough constant C. □ 

Next, we show that the clique is dense in the first n P\ coordinates (ordered according to the top 
eigenvector v). 



Claim 17. Suppose v 1 > . . . > v n and £ î6SnP v, > y/\SCiP\/2. Then for i = 8|S n P\ 

\SnP\ 



\[£}nP\> 



Proof of Claim [T7\ For any integer i, 



> 



5nP, 

1 1 i>i,i£P 



l 



> 



\snP\ 



\snP\ 



K iEP i<i,i£P 

(V\snP\/2-^/m^P 



Thus, 

LSnPI 
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Taking l = 8\S H P| (optimal), we have 

vtwt^^v^ïïpt 

□ 

Given this density, it is possible to pick lOlogn vertices from the clique and use this as a seed to find the 
rest of the clique. When £ = 8\S n P|, in each itération there is at least a 

g- lOlogn _ n -30 

chance that Q\ Ç P. With high probability, no set of lOlogn vertices in P has more than 21ogn common 
ncighbors outside of P in G. The contrary probability is 

( \P\ \( n \ 2 -20i° g 2 « = o m 

VlOlogn/ V21ogny V h 

Letting Q2 be the common neighbors of Qi in G, it follows that Q2 2 P and IQ2 \ P\ < 21ogn. Now, with 
high probability no common neighbor has degree more than 3|P|/4 in P, because 

/ |P| \fn \ (_|p| /2 4) = o(l). 
VlOlogn/ V21ogny PV 1 17 ; V ' 

for |P| > 3121og 2 n. 

Thus, with high probability no vertex outside of P will have degree greater than 7|P|/8 in the subgraph 
induced by Q2- 
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